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Abstract We establish the following result: if the graph of a (nonsmooth) real-extended- valued 
function / : M. n — ► RU {+00} is closed and admits a Whitney stratification, then the norm of the 
gradient of / at x 6 dom/ relative to the stratum containing x bounds from below all norms 
of Clarke subgradients of / at x. As a consequence, we obtain some Morse-Sard type theorems 
as well as a nonsmooth Kurdyka-Lojasiewicz inequality for functions definable in an arbitrary 
o-minimal structure. 
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1 Introduction 

Nonsmoothness in optimization seldom occurs in an arbitrary manner, but instead it is often 
well-structured in the sense that a naturally arising manifold Ai contains the minimizer, and 
the function is smooth along this manifold. We quote |15j for formal definitions, examples and 
more details. In the last two decades, several researchers have tried to capture this intuitive idea 
in order to develop algorithms ensuring better convergence results: see for instance the pioneer 
work ^3] , and also \Tj\ for a recent survey. 

In this work we shall be interested in a particular class of well-structured (nonsmooth) func- 
tions, namely functions admitting a Whitney stratification (see Section 2 for definitions). Since 
this class contains in particular the semialgebraic and the subanalytic functions (more generally, 
functions that are definable in some o-minimal structure over R) , the derived results can directly 
be applied in several concrete optimization problems involving such structures. Our central idea 
is to relate derivative ideas from two distinct mathematical sources: variational analysis and 
differential geometry. Specifically, we derive a lower bound on the norms of Clarke subgradients 
at a given point in terms of the "Riemannian" gradient with respect to the stratum containing 
that point. This is a direct consequence of the "projection formula" given in Proposition 21 and 
has as corollaries a Morse-Sard type theorem for Clarke critical points of lower semicontinuous 
Whitney stratifiable functions (Corollary EJii)) as well as a nonsmooth version of the Kurdyka- 
Lojasiewicz inequality for lower semicontinuous definable functions (Theoremlll[). Although the 
proofs are reasonably routine, analogous results fail for the (broader) convex-stable subdifferen- 
tial (introduced and studied in |Hj), unless / is assumed to be locally Lipschitz continuous, see 
Remark |HJ 

As mentioned above, convergence theory for algorithms is one of the main motivations for 
this work. In order to treat nonconvex (and nonsmooth) minimization problems, the authors 
of [3] introduced an algorithm called the "gradient sampling algorithm". The idea behind this 
algorithm was to sample gradients of nearby points of the current iterate and to produce the next 
iterate by following the vector of minimum norm in the convex hull generated by the sampled 
negative gradients. In the case that the function is locally Lipschitz, the above method can be 
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viewed as a kind of e-Clarke subgradient algorithm for which both theoretical and numerical 
results are quite satisfactory, see [3] ■ The convergence of the whole sequence of iterates remains 
however an open question and this is also the case for many classical subgradient methods for 
nonconvex minimization, see Following the ideas of ^1 and |12| . we would hope that 

Lojasiewicz inequality that we develop ((|17jl in Section 0} could play a prominent role in the 
global convergence of subgradient algorithms. 



2 Preliminaries 

In this section we recall several definitions and results concerning nonsmooth analysis (sub- 
gradients, generalized critical points) and stratification theory. In what follows the vector space 
1™ is endowed with its canonical scalar product (•,•}. 

Nonsmooth analysis. Given an extended-real-valued function / : R n -> KU {+00} we 
denote its domain by dom/ := {x G W 1 : f(x) < +00}, its graph by 

Graph/ := {(x,f(x)) G R n xR : x G dom/} 

and its epigraph by 

epi/ :={(x,(3)eR n xR: f{x)<(3}. 

In this work we shall deal with lower semicontinuous functions, that is, functions for which epi/ 
is a closed subset of M n xIR. In this setting, we say that x* G R n is a Frechet subgradient of / at 
x G dom / provided that 

liminf /fr)-/(*)Hf,y-*) > . (1) 

y^x,y^x \\y - X\\ 

The set of all Frechet subgradients of / at x is called the Frechet subdifferential of / at x and 
is denoted by df(x). If x £ dom / then we set df(x) = 0. 

Let us give a geometrical interpretation of the above definition: it is well known that the 
gradient of a C 1 function / : IR n — > R at x G R n can be defined geometrically as the vector 
V/(x) G R n such that (V/(x),— 1) is normal to the tangent space T( x j( x \\ Graph/ of (the C 1 
manifold) Graph / at (x,f(x)), that is, 

(V/(x),-l)±T (a!i/(!B)) Graph/. 

A similar interpretation can be stated for Frechet subgradients. Let us first define the (Frechet) 
normal cone of a subset C of R n at x G C by 



N c (x) 



veR n : limsup (v, — ^ ) < 

y— >x y\\ 
yec\{x} 



(2) 



Then it can be proved (see |211 Theorem 8.9], for example) that for a nonsmooth function / we 
have: 

x*eBf(x) if and only if (x*, -1) G N epif (x, f(x)). (3) 
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The Frechet sub differential extends the notion of a derivative in the sense that if / is differentiable 
at x then df(x) = {V/(x)}. However, it is not completely satisfactory in optimization, since 
df{x) might be empty- valued at points of particular interest (think of the example of the function 
f{x) = — \\x\\, at x = 0). Moreover, the Frechet subdifferential is not a closed mapping, so it is 
unstable computationally. For this reason we also consider (see |21( Chapter 8], for example): 



(i) the limiting subdifferential df{x) of / at x G dom/: 



lim x n = x, 

n— >oo 



p G df(x) 



) neN C Graphs/ : < 



lim f(x n ) = f(x), 



lim x* n = p, 



(4) 



where Graph <9f := {(u, u*) : u G df(u)}. 
(ii) the asymptotic limiting subdifferential d°°f(x) of / at x G dom/: 



q G a°°/(x) 



3(yn,yj^)n6N C Graph <9/, 3t n \ + : < 



lim y n = x, 

n^oo 

lim f(y n ) = f(x) 

n^oo 

lim t n y* = q. 



(5) 



When x dom / we set df(x) = d°°f{x) = 0. 

The Clarke subdifferential d°f(x) of / at x £ dom / is the central notion of this work. It can 
be defined in several (equivalent) ways, see 0]. The definition below (see |191 Theorem 8.11]) 
is the most convenient for our purposes. (For any subset S of W 1 we denote by co S the closed 
convex hull of S.) 



Definition 1 (Clarke subdifferential) The Clarke subdifferential d°f(x) of / at x is the set 

co {df(x) + d°°f(x)} , if x G dom/ 

0, if x ^ dom/ 



d°f(x) 



(6) 



Remark 1 It can be shown that an analogous to (jHJ) formula holds also for the Clarke subd- 
ifferential, if N ep if(x,f(x)) is replaced by the Clarke normal cone, which is the closed convex 
hull of the limiting normal cone. The latter cone comes naturally from the Frechet normal cone 
by closing its graph, see pp. 305, 336] for details. 



From the above definitions it follows directly that for all x G W 1 , one has 

df(x) C df(x) C 8°f(x). 



(7) 



The elements of the limiting (respectively, Clarke) subdifferential are called limiting (respec- 
tively, Clarke) subgradients. 

The notion of a Clarke critical point (respectively, critical value, asymptotic critical value) 
is defined as follows. 
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Definition 2 (Clarke critical point) We say that x G M n is a Clarke critical point of the 
function / if 

Definition 3 ((asymptotic) Clarke critical value) (i) We say that a £ R is a Clarke crit- 
ical value of / if the level set /~ ({a}) contains a Clarke critical point. 

(ii) We say that A 6 MU {±00} is an asymptotic Clarke critical value of /, if there exists a 
sequence (x n , x^) n >i C Graph 9°/, such that 

f{x n ) A 

< 

k (1 + ||x n ||)||<|| - 0. 

Let us make some observations concerning the above definitions: 

Remark 2 (i) Both limiting and Clarke subgradients are generalizations of the usual gradients: 
indeed, if / is C 1 around x then we have: 

df(x) = d°f(x) = {V/(x)}. 

(ii) The asymptotic limiting subdifferential should not be thought as a set of subgradients. 
Roughly speaking it is designed to detect "vertical tangents" to the graph of /. For instance, 
for the (nonsmooth) function f{x) = x$ {x G M) we have d°°f(0) = M+. Note that since the 
domain of the Frechet subdifferential is dense in dom/, we always have d°°f(x) B 0, for all 
x G dom / (see also Corollary 8.10]); therefore, this latter relation cannot be regarded as a 
meaningful definition of critical points. 

(iii) To illustrate the definition of the Clarke critical point (Definition ^) let us consider the 
example of the function / : M — > M defined by 

ft \_[ x i if 2; < 

~ \ -yfx if X > 0. 

Then 9/(0) = and df(0) = {1}. However, since d°°f(0) = M_ it follows from © that 
d°f(0) = (—00, 1], so x = is a Clarke critical point. 

(iv) It follows from Definition El that every Clarke critical value a G R is also an asymptotic 
Clarke critical value (indeed, given xq G /~ 1 ({a}) with G d°f(xo), it is sufficient to take 
x n := xo and 2;* = 0). Note that in case that / has a bounded domain dom/, Definition El(ii) 
can be simplified in the following way: the value A G M U {±00} is critical if, and only if, there 
exists a sequence (x n ,x*)„>i C Graph 9°/, such that f(x n ) — > A and x* — ► 0. 

Stratification results. By the term stratification we mean a locally finite partition of a 
given set into differentiable manifolds, which, roughly speaking, fit together in a regular manner. 
Let us give a formal definition of a C p -stratification of a set (for general facts about stratifications 
we quote ^S] or ^Uj and references therein). 

Let X be a subset of M. n and p a positive integer. A C p stratification X = (Xi)i e j of X is a 
locally finite partition of X into C p submanifolds Xi of W 1 such that for each i ^ j 
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The submanifolds Xj are called strata of X. Furthermore, given a finite collection {A\, . . . , A q } of 
subsets of X, a stratification X =(Xi)j £ j is said to be compatible with the collection {A±, . . . , A g } 
if each Ai is a locally finite union of strata Xj . 

In this work we shall use a special type of stratifications (called Whitney stratifications) 
for which the strata are such that their tangent spaces also "fit regularly". To give a precise 
meaning to this statement, let us first define the distance (or gap) of two vector subspaces V 
and W of R n by the following standard formula 

D(V,W) = max J sup d(v,W), sup d(w,V)\ . 

[ uev,||«||=i toeW,||iu||=l J 

Note that 

sup d{v, W) = 0^V CW. 
vev,\\v\\=i 

Further we say that a sequence {Vfc}fc g N of subspaces of W 1 converges to the subspace V of W 1 
(in short, V = lim Vf.) provided 

k-^+co 

lim D(V k ,V) = 0. 

fc— >+oo 

Notice that in this case all the subspaces eventually have the same dimension (say d) , so that 
the above convergence is essentially equivalent to the convergence in the grassmannian manifold 
Gl _ 

A C p -stratification X = (Xj)j 6 / of X has the Whitney-{a) property, if for each x G Xi n Xj 
(with i ^ j) and for each sequence {xk}k>i C we have: 

lim Xk = x \ 

fc^oo 

and V T x .Xj C T 

lim T x . fc A"j = T 

where T x Xj (respectively, T Xk Xj) denotes the tangent space of the manifold Xj at x (respectively, 
of Xi at Xfc). In the sequel we shall use the term Whitney stratification to refer to a C 1 - 
stratification with the Whitney- (a) property. 



3 Projections formulae for subgradients 

In this section we make precise the links between the Clarke subgradients of a function whose 
graph (is closed and) admits a Whitney stratification and the gradients of / (with respect to the 
strata). As a corollary we obtain a nonsmooth extension of the Morse-Sard theorem for such 
functions (see Corollary EJ) . 

Let / : W a -> RU {+co} be an extended-real-valued function with a nonempty closed 
domain dom/ (that is, / is lower semicontinuous). We shall deal with Whitney stratifications 
S = (Si)i<zj of the graph Graph/ of / satisfying for all i G I and u £ Si the transversality 
condition: 

i T u Si (H) 

where 

e n+ i = (0,...,0,1) £R n+1 
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Remark 3 If / is locally Lipschitz continuous, then it is easy to check that any stratification of 
Graph / must automatically satisfy (TC). This might also happen for other functions (think of 
the non-locally Lipschitz function fix) = \f\x\: every stratification of Graph / should contain 
the stratum Si = {(0, 0)}), however the example of the function f(x) = x 3 shows that this is not 
the case for any (continuous stratifiable) function / and any stratification of its graph (consider 
the trivial stratification consisting of the single stratum Graph / and take u = (0,0)). 

Let us denote by II : M. n+1 — > W 1 the canonical projection on M n , that is, 

LT(xi, . . . , X n , t) — (^-l; • • • j %n)- 

For each iG/we set 

X i = U(S i ) and fi = f\ Xi - (8) 
Due to the above assumptions one has for all i & I 

- Xi is a C 1 submanifold of M. n , 

- fi : Xi — > R is a C 1 function. 

- X = (Xi)i£j is a Whitney stratification of dom/ = II(Graph/). 

Notation. In the sequel, for any x £ dom/, we shall denote by X x (respectively, S x ) the 
stratum of X (respectively of S) containing x (respectively (x,f(x))). The manifolds Xi are 
here endowed with the metric induced by the canonical Euclidean scalar product of R n . Using 
the inherited Riemannian structure of each stratum Xi of X, for any x £ Xi, we denote by 
Vr/(x) the gradient of fi at x with respect to the stratum Xi, (■, •). 

Proposition 4 (Projection formula) Let f : M. n -» EU {+00} be a lower semicontinuous 
function and assume that Graph f admits a Whitney stratification S = (Si)i e j satisfying (H). 
Then for all x £ dom / we have 

Proj TxXx df(x) C {Vr/(x)| ; Proj TaXm d°°f{x) = {0} (9) 

and 

Vvo} TxXx d°f{x) C {Vr/Oe)}, (10) 
where Proj y : R n — * V denotes the orthogonal projection on the vector subspace V ofM. n . 

Proof We shall use the above notation (and in particular the notation of ©)• 
Let us first describe the links between the Frechet subdifferential df(x) and the gradient of 
f\x x at a point x £ dom/. For any v £ T X X X and any continuously differentiable curve 
c : (— e, e) — > X x (e > 0) with c(0) = x and c(0) = v, the function 

/oc(:=/iO C ) : 

is continuously differentiable. In view of |211 Theorem 10.6, page 427], we have 

{ (x*, v) : x* e df(x) } C !±f(c(t))\ t=o y 

Since f t f(c{t))\ t =o = {V R f(x),v) it follows that 

Proj TxX Jf(x) C {V R f(x)}. (11) 
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In a second stage we prove successively that 

Proj TxXx df(x) C {V R f(x)} and Proj TxXx d°°f(x) C {0}. (12) 

To this end, take p G df(x), and let {xk}k C dom<9/, x* k G df(xk) be such that (x^, f(xk)) — ► 
(x, /(x)) and x£ — > p. Due to the local finiteness property of S, we may suppose that the 
sequence {uk := (x^, f(xk))}k hes entirely in some stratum Si of dimension d. 

If Si = S x then by (fTTT) we deduce that Proj^^ (x|.) = V R f(xk), thus using the conti- 
nuity of the projection and the fact that / \ Xx is C 1 (that is, V R f(xk) — * V#/(x)) we obtain 
Proj^x, (P) = V jR /(x). 

If Si ^ S x , then from the convergence (x/j, f{xk)) — > (x, /(x)) we deduce that Si H S x ^ ^ 
(thus cf = dim > dimS x ). Using the compactness of the grassmannian manifold G^, we may 
assume that the sequence {T Uk Si\k>\ converges to some vector space T of dimension d. Then 
the Whitney-(a) property yields that T D Tr x jr x y,S x - Recalling (j3J), for each k > 1 we have that 
the vector (x£, —1) is Frechet normal to the epigraph epi/ of / at Uf,, hence it is also normal (in 
the classical sense) to the tangent space T Uk Si . By a standard continuity argument the vector 

(p,-l) = lim (x* k ,-l) 

k—>oo 

must be normal to T and a fortiori to Tuji x y,S x . By projecting (p, —1) orthogonally on T X X X + 
Re n+ i D T^j^Sx, we notice that (Proj TxXx (p), -1) is still normal to T^j^S^ By the 
definition of the subgradient we conclude that 

Proj TxXx (p) = V R f(x), (13) 

thus the first part of (|12[) follows. 

Let now any q G <9°°/(x). By definition there exist {yk}k C dome)/, y£ G df{yk) and 
a positive sequence \ + such that (y^, f(yk)) — * (y, f(y)) an d tkUk ~~ ^ 1- As above we 
may assume that the sequence {yk}k belongs to some stratum Si and that the tangent spaces 
~ T(x k ,f(x k ))Si converge to some T. Since —1) is normal to T Uk Si we can similarly 

deduce that (Proj TxXx (q), 0) is normal to T^ xJ{x)) S x . Since Proj m.^x{q} t {xJ{x))Sx = T X X X this 
implies that d°° f(x) C (T X X X )~ L and the second part of (|12|) is proved. It now follows from (|12j) 
and RemarkEl(ii) that @ holds. 

In order to conclude let us recall (Definition ^) that d°f(x) = co (df(x) + d°°f(x)). In view 
of (|T2|) any element of co (df(x) + d°°f(x)) admits V R f(x) as a projection onto T X X X . By 
taking the closure of the previous set we obtain (jlUj) . □ 

Remark 4 The inclusion in H\10\i may be strict (think of the function f(x) = — HxH 1 / 2 at x = 
where d°f(0) =0). Of course, whenever d°f(x) is nonempty (for example, if / is locally 
Lipschitz), under the assumptions of Proposition 0] we have 

Proi TxXx d°f(x)={V R f(x)}. 

Corollary 5 Assume that the graph of f is closed and admits a C p -Whitney stratification sat- 
isfying (H). Then: 
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(i) for all x G domd°f we have 



HVr/(x)|| < ||s* 



for all x* G d°f(x). 



(14) 



(ii) (Morse-Sard theorem) If p > n, i/ien t/ie set of Clarke critical values of f has Lebesgue 
measure 0. 

Proof Assertion (i) is a direct consequence of (|10|> of Proposition 0J To prove (ii), set C := 
[a°/] _1 ({°}) = i x G R n : 3°/(x) 9 0}. Since the set of strata is at most countable, the 
restrictions of / to each of those yield a countable family {/ n }neN of C p functions. In view of 
p4jl. we have that C C U ne NV/~ (0). The result follows by applying to each C p -function f n 
the classical Morse-Sard theorem |22| . □ 

As we see in the next section, several important classes of lower semicontinuous functions satisfy 
the assumptions (thus also the conclusions) of Proposition and of Corollary [5J 

4 Kurdyka-Lojasiewicz inequalities for o-minimal functions 

Let us recall briefly a few definitions concerning o-minimal structures (see for instance, Coste [5], 
van der Dries-Miller [H] , Ta Le Loi [21] , and references therein) . 

Definition 6 (o-minimal structure) An o-minimal structure on (R, +, .) is a sequence of 
boolean algebras O n of "definable" subsets of R n , such that for each n G N 

(i) if A belongs to O n , then Axi and R x A belong to O n +i ; 

(ii) if II : M n+1 — > R n is the canonical projection onto R n then for any A in O n +i, the set 
11(^4) belongs to O n ; 

(iii) O n contains the family of algebraic subsets of R n , that is, every set of the form 



where p : R n — > R is a polynomial function ; 

(iv) the elements of 0\ are exactly the finite unions of intervals and points. 

Definition 7 (definable function) Given an o-minimal structure O (over (R, +,.)), a func- 
tion / : R n -»RU {+00} is said to be definable in O if its graph belongs to O n +i. 

Remark 5 At a first sight, o-minimal structures might appear artificial in optimization. The 
following properties (see |Hj for the details) might convince the reader that this is not the case. 

(i) The collection of semialgebraic sets is an o-minimal structure. Recall that semialgebraic 
sets are Boolean combinations of sets of the form 



{x e R n : p{x) = 0} 



{x G R n : p(x) = 0, q\(x) < 0, . . . , 



q m (x) < 0} 



where p and qi's are polynomial functions on R n . 

(ii) There exists an o-minimal structure that contains the sets of the form 



{(x,t) G [-1,1]" xR:f{x)=t} 
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where / : M. n — > R is real-analytic around [— 1, l] n . 

(iii) There exists an o-minimal structure that contains simultaneously the graph of the ex- 
ponential function M9n-> exp x and all semialgebraic sets (respectively, the structure defined 
in (ii)). 

Let us finally recall the following important fact: the composition of mappings that are 
definable in some o-minimal structure remains in the same structure [HI Section 2.1]. This is 
true for the sum, the inf-convolution and several other classical operations of analysis involving 
a finite number of definable objects. This remarkable stability, combined with new techniques of 
finite-dimensional optimization offers a large field of investigation. Several works have already 
been developed in this spirit, see for instance pQ, [2j. 

Given any o-minimal structure O and any lower semicontinuous definable function / : R n — > 
R U {+00} the assumptions of Proposition 0] are satisfied. More precisely, we have the following 
result. 

Lemma 8 Let B := {B\, . . . , B p } be a collection of definable subsets ofW 1 . Then there ex- 
ists a definable C p -Whitney stratification {Si, . . . ,St} of the graph Graph f of f satisfying the 
transversality condition (Ti) and yielding (by projecting each stratum Si C R n+1 onto W 1 ) a 
C p - Whitney stratification {Xi, . . . , X{\ of the domain dom f of f compatible with B. 

Proof Let {o~i, . . . ,o~ m } be a C 1 -stratification of the definable set Graph / and set Ej = n(<Tj) 
and Yij = Ej n Bj for 1 < i < m and 1 < j < p. Since the mapping LT : o~i — > Ej is an open 
continuous one to one mapping, its inverse Ej 3 x — » (x,f(x)) £ ai is continuous and thus so 
is /|ej- Since the restriction of / to each definable set Y^j is continuous, there exists a 

C 1 -stratification {^j,fc}i<fc< gi • of each definable set Yij such that f\z t fc is C 1 (see Theorem 
6.7], HD| or jUj for example). Set 

Ai,j,k = {{x, /(»)) : x e Zij )k } 

and consider a C p - Whitney stratification S = {Si, . . . , Sf} of the definable set Graph / com- 
patible with the definable sets -Aij,fc (see j'241 Theorem 1.3], for example). It is easily seen that 
S satisfies the transversality condition (Ti). Indeed, since dom/ = \J i ■ k Z^^j., the relation 
e n+ i S Tt x jt x ))Si for some x £ dom / belonging say to the stratum Zij^ C Si is contradicting 
the differentiability of f\z l]k at x. Setting Xi = H(Si), it is easily seen that the obtained 
C p - Whitney stratification X = {X\, . . . ,Xi} of dom / is compatible with the collection B. □ 

Remark 6 The aforementioned result can also be obtained by evoking more delicate results 
on stratification of functions ( |23|. for example). We give an elementary proof for the reader's 
convenience. 

Corollary 9 Let f : M n — > R U {+00} be a lower semicontinuous definable function. There 
exists a finite definable Whitney stratification X = (Xiji^i of domf such that for all x E domf 

Proi TxXx d°f(x) C {V R /(x)}. (15) 

As a consequence 



9 



(i) For all x £ domd°f and x* £ d°f(x), we have ||Vr/(x)|| < ||x*|| ; 

(ii) The set of Clarke critical values of f is finite ; 

(Hi) The set of asymptotic Clarke critical values of f is finite. 

Proof Assertion (i) is a direct consequence of (|15|) . This projection formula follows directly 
by combining Lemma |H1 with Proposition To prove (hi), let fa be the restriction of / to the 
stratum Xj. Then assertion (i), together with the fact that the number of strata is finite, implies 
that the set of the asymptotic Clarke critical values of / is the union (over the finite set I) of 
the asympotic critical values of each (definable C 1 ) function Thus the result follows from 
Remarque 3.1.5]. Assertion (ii) follows directly from (hi) (cf. Remark [2] (hi)). □ 



Remark 7 The fact that the set of the asymptotic critical values of a definable differentiable 
function / is finite has been established in [BJ Theoreme 3.1.4] (see also |13l Theorem 3.1] for the 
case that the domain of / is bounded). In |12[ Proposition 2] a more general result (concerning 
functions taking values in M fc ) has been established in the semialgebraic case. 

Before we proceed, let us recall from Kurdyka |12| Theorem 1] the following result: 

Theorem 10 (Kurdyka-Lojasiewicz inequality) Let f : U — > be a definable differen- 
tiable function, where U is an open and bounded subset ofW 1 . Then there exist p, c > and a 
strictly increasing definable function ip : (0, p) — ► (0, +oo) of class C 1 such that 

||V(^o/)(x)|| > c, for each x £ U n f' 1 ^, p). (16) 

Remark 8 Let us observe that in the conclusion of the above result, there is no loss of generality 
to assume c = 1 and ip being defined and continuous on [0, p) with ?/>(0) = 0. Moreover, a careful 
examination of the proof of |12| Theorem 1] shows that the result of Theorem EH remains valid 
if U is any nonempty bounded definable submanifold of M n . 

We shall use Corollary El to extend Theorem 1101 to a nonsmooth setting. 

Theorem 11 (Nonsmooth Kurdyka-Lojasiewicz inequality) Let f : W 1 — ► R U {+00} be 
a lower semicontinuous definable function and U be a bounded definable subset ofW 1 . There 
exist p > and a strictly increasing continuous definable function ip ■ [0, p) — * (0, +00) which is 
C l on (0,p) with ip(0) = and such that for all x e U n | / | _1 (0,p) and all x* G d°f(x) 

\\x*\\ > .,..). w (17) 

Proof Set Ui = {x G U n dom/ : f(x) > 0} and U 2 = {x £ U n dom/ : f(x) < 0} 
and let X l5 . . . ,Xi be a finite definable stratification of dom/ compatible with the bounded 
(definable) sets U\ and U2 such that the definable sets Si = {(x,f(x) : x £ Xi} are the strata 
of a definable C p - Whitney stratification of Graph/ satisfying (7i) (cf. Lemma |SJ). For each 
i £ {1, . . . , /} such that Xj C f/i we consider the positive C 1 function /j := f\x t on the definable 
manifold Xi (thus for x £ Xi we have Vfi(x) = V#/(z) and fi(x) = f(x)) and we apply 
Theorem ^] (and RemarklSJ) to obtain pi > and a strictly increasing definable C^-function 
ipi : (0,Pi) -> (0,+oo) such that for all x £ /i -1 (0,pj) we have ||V R /(x)|| > Wi{f{x))}~ 1 . 
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Similarly, for each j G {1,...,/} such that Xj C U2 we consider the positive C 1 function 
fj := —f\xi (note that for x G Xj we have Vfj(x) = —Vjif(x) and /j(x) = —f(x)) to obtain 
as before pj > and a strictly increasing definable C -function ipj : (0, pj) — ► (0, +00) such that 
for all x G /, -1 (0,pj) we have ||Vij/(x)|| > [^-(-/(s))] -1 . Thus for all i G {1, . . . ,1} there exist 
Pi > and a strictly increasing definable C 1 -function ^ : (0, p%) — > R such that 

||V fl /(x)||> 1 for all xG U n \f\-\0, Pi ). 

A(\f( x )\) 

Set p = minpi and let 11,12 G {1, . . . ,1}. By the monotonicity theorem for definable functions 
of one variable (see |121 Lemma 2] , for example) , the definable function 

(0,p)3r-.lM 1 (r)-l/< a (r) 

has a constant sign in a neighborhood of 0. Repeating the argument for all couples i\,i2 and 
shrinking p if necessary, we obtain the existence of a strictly increasing, positive, definable 
function ip = tpi on (0, p) of class C 1 that satisfies 1/ip' < on (0,p) for all i G {1, ... ,1}. 
Evoking Corollary El(i) , we obtain for all x G U n |/| _1 (0,p) and all x* G 9°/(x) 

||x*|| > ||V R /(x)|| > 



n\m)\' 

Since ifi is definable and bounded from below, it can be extended continuously to [0,p). By 
adding eventually a constant, we can also assume i/j{0) =0. □ 

The assumption that the function / is definable is important for the validity of Q17|). It 
implies in particular that the connected components of the set of the Clarke critical points of / 
lie in the same level set of / (cf. Corollary El(ii)). Let us present some examples of C 1 -functions 
for which (|17|) is not true. 

Example 1 (i) Consider the function / : R — > R with 



x sin -, if x ^ 
0, if x = 



Then the set S = {x G R : f'(x) = 0} meets infinite many level sets. Consequently, (fT7|) is not 
fulfilled. 

(ii) A nontrivial example is proposed in 20, page 14], where a C°° "Mexican-hat" function has 
been defined. An example of a similar nature has been given in pQ, and will be described below: 
Let / be defined in polar coordinate on R 2 by 



f(r,6) 



' exp(- I j ? ) [1 - 4r 4 + y_ r . )4 sm(0 - ^)], if r < 1 
0, if r > 1. 



The function / does not satisfy the Kurdyka-Lojasiewicz inequality for the critical value 0, i.e. 
one can not find a strictly increasing C 1 function t/j : (0,p) — ► (0, +00), with p > 0, such that 

||V(V°/)(x)|| >1 
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for small positive values of f(x). To see this, let us notice that the proof of ( |12l Theorem 
2]) shows that for any C 1 function / (not necessarily definable) that satisfies the Kurkyka- 
Lojasiewicz inequality, the bounded trajectories of the gradient system 

x(t)+Vf(x(t)) = 

have a bounded length. However, in the present example, taking as initial condition tq S (0, 1) 
and #o such that 9q(1 — tq) 2 = 1, the gradient trajectory x(t) = —Vf(x(t)) must comply with 

9(t) ' 



l-r(t) 2 ' 



where r(t) /* 1 as t — > +oo (see for details). The total length of the above curve is obviously 
infinite, which shows that the Kurdyka-Lojasiewicz inequality (for the critical value 0) does not 
hold. 



Let us finally give an easy consequence of Theorem II II for the case of subanalytic functions. 

Corollary 12 (Subgradient inequality) Assume that f : R n ->1U {+00} is a lower semi- 
continuous globally subanalytic function and /(xq) = 0. Then there exist 5 > and 9 £ [0, 1) 
such that for all x G \ f | -1 (0,<5) we have 

\f(x)\ 6 < for all x* £ d° f (x) . 

Proof In case that / is globally subanalytic, one can apply |12| Theorem (LI)] to deduce that 
the continuous function tp of Theorem 1111 can be taken of the form ip(s) = s 1_e with 9 G (0, 1). 
□ 



Remark 9 Corollary EJii) (and a fortiori Corollary I12|) extends \2 , Theorem 7] to the lower 
semicontinuous case. We also remark that the conclusions of Theorem ^2 and of Corollary El 
remain valid for any notion of sub differential that is included in the Clarke subdifferential, thus, 
in particular, in view of l|7|. for the Prechet and the limiting subdifferential. However, let us 
point out that this is not the case for broader notions of subdifferentials, as for example the 
convex-stable subdifferential introduced and studied in [3]. It is known that the convex-stable 
subdifferential coincides with the Clarke subdifferential whenever the function / is locally Lips- 
chitz continuous, but it is strictly larger in general, creating more critical points. In particular, 
(21 Section 4] constructs an example of a subanalytic continuous function on R 3 that is strictly 
increasing in a segment lying in the set of its broadly critical points (that is, critical in the sense 
of the convex-stable subdifferential). Consequently, Theorem 1111 and Corollary 1121 do not hold 
for this subdifferential. 
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